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Abstract 


Background: Evaluating the toxicity or effectiveness of two or more toxicants in a specific 
population often requires specialized statistical software to calculate and compare median 
lethal doses (LDsos). Tests for equality of LDsos using probit regression with parallel slopes 
have been implemented in many software packages, while tests for cases of arbitrary slopes 
are not generally available. 

Methods: In this study, we established probit-log(dose) regression models and solved them 
by the maximum likelihood method using Microsoft Excel. The z- and 7?-tests were used to 
assess significance and goodness of fit to the probit regression models, respectively. We 
calculated the lethal doses (LDs) of the toxicants at different significance levels and their 95% 
confidence limits (CLs) based on an accurate estimation of log(LD) variances. We further 
calculated lethal dose ratios and their 95% CLs for two examples without assuming parallel 
slopes following the method described by Robertson, et al., 2017. 

Results: We selected representative toxicology datasets from the literature as case studies. 
For datasets without natural responses in the control group, the slopes, intercepts, y? statistics 
and LDs calculated using our method were identical to those calculated using Polo-Plus and 
SPSS software, and the 95% CLs of the lethal dose ratios between toxicants were close to 
those calculated using Polo-Plus. For datasets that included natural responses in the control 
group, our results were also close to those calculated using Polo-Plus and SPSS. 

Conclusion: This procedure yielded accurate estimates of lethal doses and 95% CLs at 
different significance levels as well as the lethal dose ratios and 95% CLs between two 
examples. The procedure could be used to assess differences in the toxicities of two examples 


without the assumption of parallelism between probit-log(dose) regression lines. 


Keywords: Toxicity; Probit regression; Lethal dose ratio; Maximum likelihood 
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Background 

In toxicological, entomological and environmental studies, doses of toxicants that kill a 
defined proportion of organisms, e.g., the median lethal dose (LDs0) which kills 50% of the 
population, are typically used as indicators of acute toxicity. Comparing the activities of 
different toxicants in a specific population or determining the relative susceptibilities of 
different populations to a single toxicant are common research goals. The relative potency, 
which assumes that the regression lines of the two toxicants being compared are parallel, 
provides a convenient comparison of the toxicities of two toxicants [1]. 

However, in practice, many regression lines are not parallel, particularly those derived 
from bioassays of toxicants with different modes of action, or from same-action toxicants 
administered to populations with different resistance levels. The 95% confidence limits (CLs) 
of a lethal dose ratio can be calculated by estimating the slopes and intercepts of two probit 
regression lines and constructing their variance and covariance matrices. The 95% CLs of this 
ratio indicate whether the lethal doses of the two toxicants are statistically different from one 
another [2]. Polo-Plus software, developed by Robertson et al. [3], separately analyzes the 
data for each substance using probit or logit models based on the joint probability of all 
observations and calculates lethal dose ratios and their CLs at different significance levels. 
IBM SPSS provides solution to calculate the lethal doses with 95% CLs based on probit or 
logit models, and also the relative median potency (RMP) assuming that the two regression 
lines are parallel [4]. 

In this study, we calculated lethal doses and 95% CLs of toxicants at different 
significance levels, as well as the lethal dose ratio and its 95% CLs for two toxicants, from 
probit-log(dose) regression models constructed using the maximum likelihood method in 
Microsoft Excel. The effectiveness of this method was compared with that of Polo-Plus and 


IBM SPSS. 


Methods 


Construction of probit-log(dose) regression models for a single toxicant or 
population 

For a population treated with serial doses (7) of a toxicant, in which n subjects were 
treated and r subjects exhibited a characteristic response to each dose, the empirical 


proportion (p*) of responders was given by 


pi ==. (1) 
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where i = 1 to k and k indicated the number of toxicant doses. 
If the characteristic response occurred in the control group (natural response) with 
proportion C, the proportions of responders were corrected using the Abbott equation for each 


treatment dose [5]: 


“=C 
pi = (2) 


The corrected proportion (pi) was then converted to a probit value (y;) [1]: 
yi = P* (pi), (3) 
which was calculated as y; = NORM.S.INV(pi) in Excel. 
A provisional regression line between y; and the logarithm of the dose (xi) was 
established: 
Vi = Ay + Poxi. (4) 
In the regression equation, i = 1 to m, where m is the number of toxicant doses at which 
the corrected proportion was not equal to 1 or 0. The intercept (œo) and slope (fo) could be 
calculated by the least-squares procedure and were retrieved using the INTERCEPT(i, xi) and 
SLOPE(y;, xi) functions, respectively, in Excel. 
We then calculated the expected probits (Y) for all dose sets, included those where the 
corrected proportion was 1 or 0: 
Y; = A + Box. (5) 
In Eq. (5), i = 1 tok. 
We next calculated the expected response proportion (Pi) for each dose set [1]. 
P, = O(¥;)*A-C)+C, (6) 
where @(Y;) returned the cumulative probability of the standard normal distribution 
corresponding to (Yi), obtainable using the NORM.S.DIST (Y;) function in Excel, and C was 
the natural response proportion, if one existed, in Eq. (2). 


The working probit (yi) was calculated from the following equation [1]: 


Pi i 
Waar (7) 
where 
1 _ 2 
Zi = on 0.5Y; š (8) 


An optimized set of expected probits was then derived from the linear regression equation 
of working probits weighted on x;, with each y; being assigned a weight, niwi, where w; was 
the weighting coefficient. This was calculated as previously described [1] 


= z? 
= z 
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where C was the natural response proportion in Eq. (2). 
The slope £ of the working probit-logio(dose) regression equation was 


k , F _G 
B = Xiz1 MWi(Xi-X)Oi-Y) (10) 


DE, niwi(xi-x)? 
The intercept a of the working probit regression equation was 
a=y- PX, (11) 
where y was the average of y and xX was the average of x: 


k k 
visi UWivi = _ Lina MiWiXi 12 

k > x= k . ( ) 
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and the standard error of a was [6] 


y= 


The standard error of f was [1] 


1 
k 
viet MW 


o(a) = + 20(B)2. (14) 


The x? statistic of the probit regression equation was [1] 


2= yk nipi- Pi)” (15) 


The significance level p of the x? statistic was calculated as the right-tailed probability of 
the chi-squared distribution (CHISQ.DIST.RT) with k — 2 degrees of freedom (d.f). 

A significant 7 statistic (p < 0.05) might indicate either that the population did not 
respond independently or that the fitted probit-log(dose) regression line did not adequately 
describe the dose-response relationship in the test samples. 

To get an optimal fit of the probit-logio(dose) regression, we substituted a and p for a 
and Bo and repeated the calculations of Eq. (5) to Eq. (15) until a stable x? appeared, 
indicating convergence. This procedure was a maximum likelihood (ML) method [1]. 


The significance of the slope was assessed using the z test [7], 


za. (16) 
o(B) 


If the absolute z-value was less than 1.96, the regression slope was not significant and the 
data were excluded from further analysis. Similarly, we might test the significance of the 
intercept (a). 

The heterogeneity factor h of the regression equation was calculated to adjust for large 7. 


h was defined as [1] 


-X 
h= 2. (17) 
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If h < 1, the model provided a good fit to the data. Otherwise, standardized residuals were 
plotted to identify outliers or other possible causes of poorness of fit [8]. Each residual 
defined the difference between the observed r; and the expected response number (n;Pi) for 
each dose. The residuals were standardized by dividing them by their standard 
errors,,/n;P;(1 — Pi) . For models providing a good fit, the standardized residuals fell mostly 
between —2 and 2 [8]. Standardized residuals distributed randomly showed no systematic 
patterns or tendencies toward positive or negative sign. 

Calculation of the lethal doses of toxicants or populations and their 95% CLs 

In this step, we first calculated the logarithms of the doses (0x) at which levels of interest 


(n) gave the expected response proportion: 
— Yna 
Or = 7” 
where yz was the n® percentile of the probit distribution curve calculated in Excel using 
NORM.S.INV(z) for the probit distribution. For example, if n = 10, 50, 90 and 99, yr was 


calculated as -1.282, 0, 1.282 and 2.326. 


(18) 


The z" lethal dose was then calculated as 
LD, = 10°. (19) 
The standard error of Or, o(8,,), was given by [1] 


=i 1 (Or)? 
Pin = BSE nw Eka niwi E) (20) 
The 95% CL of the LD; was then given as 
10ĉr+to.05,k-20 (8r), (21) 


to.os,k-2 returned the two-tailed inverse of the Student’s t-distribution at a = 0.05 with d.f. 
= k-2[T.INV.2T(0.05, k-2)]. 
The g value could be calculated to adjust if the confidence limits were valid. The g value 


was given as [9]: 


t2 2 A 
g= p ht. (22) 


If p (77) was less than 0.15, t = 1.96 and h” = 1; otherwise, hř = h and t = to.o5,k-214]. If g 


exceeded 1, the CLs for the LD, did not have practical importance [1]. 

The above steps were repeated to determine all parameters for the second toxicant for the 
same population, or the same toxicant in the second population. 

Comparison of lethal dose ratios of two toxicants or populations 

If there were / toxicants or populations in the experiment, then we compared the LDz 


values of the first (as a reference) to those of others. We first calculated the difference 
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between the log(doses) yielding the expected response proportions (x percentile) for 
toxicants or populations | and j (j=2 to l), On1;=On1 - Orj. Its standard error was given by [2] 
o(On1;) = (on)? + 0 (Ox). (23) 
The ratio of the two lethal doses was then given as 
Ratio(:) = 109-9, (24) 
and the 95% CLs were 
1 09n1jt1-960(On1j)_ (25) 
If the 95% CLs of this ratio excluded 1.0, the lethal doses of the two toxicants or 
populations were significantly different; otherwise, there was no evidence to reject the null 
hypothesis of equal LDs [2]. 
Test for parallelism of the two regression equations 
Although the above procedures did not assume equal slopes of the two regression lines, 
the specific LD;z level used depended on the parallelism of the regression lines. To examine 
parallelism of the two regression lines, we used the z-test [10]: 
|B1-B;l 


owaye+o(Rp? 


If the absolute z-value exceeded 1.96, the two regressions were non-parallel; otherwise, 


z= (26) 


they were parallel. 

Case studies 

The above procedures might be executed on an Excel (version 2010 or higher) 
spreadsheet (provided as an Additional file). To compare the results of the ML procedure in 
Excel with those of Polo-Plus and SPSS, we extracted bioassay data from the literature: (1) 
chrysanthemum aphids dosed with Rotenone, Deguelin, and a mixture of these two substances 
[11], (2) three populations, Fairfax, Pixley and Schaefer, of the pest bug "Wicked Witch of 
the West" dosed with deltamethrin [12], and (3) two populations, BugRes and BugLab, of 
Godfather larvae dosed with pyrethroid [2] (Table 1). 


Results 

Slopes, intercepts and significance testing of probit-log(dose) models fitted to the 
example data 

When we implemented the ML procedure to solve the probit-log(dose) equations for the 
three sample data in Excel, for the datasets in which there was no natural response (e.g., 
Rotenone, Deguelin, Mixture, Fairfax and Schaefer), the slope (£) and intercept (a) estimates 
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of the converged probit-log(dose) regression were identical to those calculated using 
Polo-Plus and SPSS (with two methods, SPSS! and SPSS?, to include the natural response 
proportion, C, by inputting the value of C and calculating it from the data, respectively) 
(Table 2). The standard errors of both f and a, calculated by Eq. (13) and Eq. (14), were close 
but not identical to those calculated using Polo-Plus and SPSS (Table 2). When the data sets 
included natural responses (e.g., Pixley, BugRes and BugLab), f and a, as well as their 
standard errors, were close to those produced by Polo-plus and SPSS. The results of our 
method and Polo-Plus were closer to those calculated using SPSS! method than those 
calculated using SPSS? method (Table 2, Bold items). 

The probit-log(dose) regression model assumes a linear relationship between the 
logarithm of serial doses and the probit of the response proportions. When z-tests (this study 
and SPSS) or the f-ratios (Polo-Plus) were used to evaluate the significance of the regressions, 
all z values and f-ratios for both 2 and a estimates calculated using all four methods exceeded 
1.96 (Table 2), indicating that all regression parameters were significant. If the z-value for the 
slope was less than 1.96, the regression model would be insignificant and the dataset should 
be excluded from further analysis. 

Goodness-of-fits of the probit-log(dose) regressions 

While z-tests evaluated whether a linear relationship existed between the probits and the 
log(dose), y? tests are usually used to test the goodness-of-fit of the log(dose)-probit 
regression model. For datasets that did not include natural responses, the y? and h values 
calculated in this study were identical to those calculated using Polo-Plus and SPSS (Table 3). 
When the datasets included natural responses, the y? and h values were close to those 
produced by Polo-plus and SPSS. Again, the results of our method and Polo-Plus were closer 
to those calculated using SPSS! method than those calculated using SPSS? method (Table 3, 
Bold items). 

For some datasets, y? was not significant but h was greater than 1 (Table 3). When 
standardized residuals were plotted against log(doses), one or more outliers were observed 
(outside the bounds of -2 to 2) in the Schaefer and BugLab data. For the BugLab data 
especially, the standardized residuals were not distributed randomly and showed a tendency 
toward positive sign (Figure 1), indicating that this data should be fitted using other models 
[13]. 

LD1o, LDs0, LD and LD estimates with 95% CLs 

We further compared the LD;s and their 95% CLs calculated using these four methods. 


For datasets that did not include natural responses, the LD;s calculated in this study were 
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identical to those calculated using Polo-Plus and SPSS, and the 95% CLs of LDxzs calculated 
using our method were close but not identical to those produced by Polo-Plus and SPSS 
(Table 4). For datasets that included natural responses, the LDąs and their 95% CLs were 
close but not identical to those calculated using Polo-plus and SPSS. The results of our 
method and Polo-plus were closer to those calculated using SPSS' method than those 
calculated using SPSS” method (Table 4, Bold items). 

Comparison of lethal dose ratios between two samples 

For datasets that did not include natural responses, the LD; ratios calculated using our 
method were identical to those calculated using Polo-Plus and their 95% CLs were also close. 
For datasets that included natural responses, LD; ratios and their 95% CLs calculated using 
our method were similar to those calculated using Polo-Plus (Table 5, Bold items). The LDso 
ratios and their 95% CLs calculated using our method were closer to those calculated using 
Polo-Plus than to the relative median potency (RMP) calculated using SPSS (Table 5). 

When judged by whether the 95% CLs of lethal ratios included 1.0, all methods reached 
the same conclusions for toxicity differences between two samples (Table 5). 

Comparisons of two regression slopes 

Parallelism between paired regression equations was examined using z-tests. The 
conclusions of our method for the five regression pairs were identical to those arrived at by 


Polo-Plus and SPSS, which used x? tests (Table 6). 


Discussion 

Many methods have been developed to calculate the lethal or effective doses of toxicants 
and their confidence limits. Probit analysis, developed by Bliss [14] and improved by Finney 
[11], is one such commonly-used method. To calculate the parameters of the probit-log(dose) 
regression, Finney suggested fitting the regression line by eye as precisely as possible and 
obtaining parameters, such as slopes and intercepts, of the provisional regression line at the 
first stage. Thereafter, one calculates the working probits Y, and repeats this process with the 
new set of Y values; when the iterations converge, this gives a precise estimate of the linear 
regression parameters [1]. In this study, we calculated slopes and intercepts for the provisional 
regression line by the least-squares procedure, and calculated working probits and performed 
the iteration procedure (ML) using the popular software program, Microsoft Excel. We 
obtained similar results to those obtained using Polo-Plus and SPSS. 


Several software packages, such as Polo-Plus and SPSS, might be used to calculate the 
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lethal doses and 95% CLs at different significance levels, and even test the equality of the 
lethal doses. Such professional statistical programs are difficult to handle for common 
toxicologists and environmental ecologists, and are easily abused. Excel in the Microsoft 
Office Package is the most popular statistical program around the globe. As to the Excel 
spreadsheet developed in this study, the users are easily to trace the procedure which is used 
to solve the regression equations, and calculate the CLs of a lethal dose and also the lethal 
dose ratios. They may further redevelop it easily according to their request. 

x values were used as indicators of the goodness-of-fit of the probit-log(dose) regressions 
as the iteration proceeded. The equations 


(J nw(x-X)(y-P))* 


2 — 7) — 
xX = Lnw(y — y) eGo (27) 
or 
2 _ x (r-nP? 
K= nP(1-P) (28) 


could also be applied [1]. When there were no natural responses in the datasets, these two 
equations, along with Eq. (15), gave the same results when the iterations converged, and these 
results were identical to those produced by Polo-Plus and SPSS. When the datasets included 
natural responses, Eq. (27) always gave the smallest x? value, Eq. (28) always gave the largest 
value, while Eq. (15) gave an intermediate value which was closer to the output of Polo-Plus 
and SPSS (data not shown). During iteration for some datasets, the y? values produced from 
all these three equations might increase [1]. Most regression models converged after several 
iterations, and we reported the results after 20 iterations, as this was the default maximum 
used by SPSS. 

Strictly speaking, the 95% CLs of LD; were the values of x for which the boundaries of 
the fiducial band attained the relevant value of yz. The exact CLs of 6; could be calculated by 
constructing the variance matrices of the slope (var(f)) and intercept (var(a)) and their 


covariance (cov(a,f)) matrices as follow [1, 9]: 


(a,b) t (ap)? 
Or + = (6, — orak ) + aap rro — 26,cov(a, p) + 6,7var(8) — g (var(a) — Ty ». (29) 


It has been theorized that, in practice, the method for determining 95% CLs of LD; most 
often performed sufficiently good based on a trustworthy value for the variance of Ox as 
Eq.(20) [1, 15]. It was suggested that 95% CLs of LD; could be calculated using the formula 
10%£1.960(r) [15]. The results of this equation were close to those calculated using Eq. (29) 
when the dose number (k) was large (e.g., close to 10), while the CLs were much narrow than 


those calculated exactly using Eq. (29) when k was small. By contrast, the results given by Eq. 
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(21) were nearer to those calculated exactly at different levels of k. The 95% CLs of LDz 
calculated using Polo-Plus were often identical to those calculated using SPSS when there 
was no natural response, with some exceptions (e.g., the Mixture and Fairfax data; Table 4, 
italic brackets, although the g values were not large for both of these cases). 

While it is common to find estimates of LDs obtained from probit analyses in the 
toxicology literature, it is less common to find a hypothesis test procedure to determine 
whether estimated differences between LDs are statistically significant [16]. Relative potency 
has been frequently used [1, 4], but this method assumes the regression lines being compared 
are parallel. When the regression lines were parallel, the LDs and their 95% CLs for two 
toxicants calculated from the two datasets simultaneously were similar to those calculated 
from the datasets separately. However, when the regression lines were not parallel, the LDs 
and their 95% CLs calculated from the two datasets simultaneously were quite different from 
those calculated from the datasets separately. 

In cases where the data are suggestive of a trend toward significant differences between 
LDsos, the use of non-overlapping CLs for LDso values has frequently been proposed as a 
criterion for assessing significance, while use of this criterion is thought to be conservative [17, 


18]. An alternative method involves calculating the variances of 0; using the delta-method: 
var (6x) = z [var (a) + 26,cov(a, B) + 0x var(f)], (30) 


calculating the ratio of the LDs as in Eq. (24), then calculating the 95% CLs of the ratio 
as in Eq. (25) [2]. If the 95% CLs of the ratio include 1.0, the LDs of the two samples are not 
significantly different. We followed this procedure in this study, but we calculated the 
standard error of 0;,as in Eq.(20) by the maximum likelihood procedure. We obtained 95% 
CLs of the LD ratio similar to those obtained using Polo-Plus. 

Biologically, the slope of a probit or logit regression line represents the change in the 
proportion of responders per unit change in dose. Toxicological evidence suggested that the 
slope of a dose-response regression line reflected host enzyme activity [19]. Thus, 
non-parallel lines might indicate different modes of action of the two toxicants. Parallelism 
between regression pairs was essential for determining the level at which to compare the 
effects of two toxicants. Generally, there were three main categories of parallelism: (1) the two 
regression lines were statistically parallel (e.g., Fairfax vs Pixley; Fig. 2A); (ii) the two 
regression lines were not statistically parallel but did not cross within the dominant region 
(20-80%) of the response proportions (e.g., Rotenone vs Deguelin; Fig. 2B); and (iii) the two 


regression lines crossed around the median lethal dose (e.g., BugRes vs BugLab; Fig. 2C). In 
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the first case, reporting the LDsos of the two toxicants and their ratios was sufficient. In the 
second case, one should report both LDsos and LDoos (and/or LDios) and their ratios. In the 
third case, reporting the ratios of LDios, LDsos, LDoos is meaningless, but the significance of 


difference between the two slopes should be valid. 


Conclusions 

We successfully developed a method to calculate the lethal doses of a toxicant at different 
significance levels, and compare lethal dose ratios using probit-log(dose) regression by the 
ML procedure implemented in Microsoft Excel. Lethal doses calculated using this method at 
different significance levels, as well as lethal dose ratios with their 95% CLs, were identical 
or close to those calculated using Polo-Plus and SPSS. When judged by whether the 95% CLs 
of the lethal ratios included 1.0, all methods reached the same conclusions regarding toxicity 


differences between two samples. 
Abbreviations 
LDso: Median lethal dose; 95% CLs: 95% confidence limits; RMP: relative median potency; 


ML: maximum likelihood. 
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Table 1 Selected bioassay data for toxicants in experimental populations 


Toxicant Population Population 
jiii Dose n* a cia Dose n r 2i Dose n r 

Rotenone 2.6 50 6 Fairfax 0 30 0 BugRes 0 60 3 
3.8 48 16 2 48 12 3 60 9 
5.1 46 24 3 50 15 10 60 19 
RT 49 42 5 50 31l 20 60 32 
10.2 50 44 7 48 31 40 60 38 

Deguelin 5.1 49 16 10 59 52 50 60 46 
10.0 48 18 Schaefer 0 60 0 BugLab 0 60 5 
20.4 48 34 60 15 0.03 30 7 
30.2 49 47 3 120 41 0.1 30 7 
40.7 50 47 5 60 39 0.3 30 6 
50.1 48 48 10 120 110 1 30 3 

Mixture 2.5 47 7 50 120 119 3 30 3 
5.1 46 22 Pixley 0 359 7 7 30 10 
10.0 46 27 10 70 22 10 60 32 
15.1 48 38 20 70 38 15 30 22 
20.4 46 43 30 50 38 20 30 30 
25.1 50 48 50 50 48 


* n was the total number of subjects administrated at each dose. 


$ 


& “Mixture” was a mixture of Rotenone and Deguelin at 1:1. 


r was the number of subjects exhibited a characteristic response to each dose. 


429 Table 2 Slopes, intercepts and results of significance testing for the example data fitted to the 


430  probit-log(dose) regression models using the ML procedure (Excel), Polo-Plus and SPSS 


Example Estimates * Standard error (o) z’ 
Excel Polo* SPSS! SPSS? Excel Polo SPSS! SPSS? Excel Polo* SPSS! SPSS? 
b Rotenone 4.213 4.213 4.213 4.213 0.481 0.478 0.478 0.478 8.767 8.809 8.809 8.809 
Deguelin 2.633 2.633 2.633 2.633 0.279 0.279 0.279 0.279 9.434 9.421 9.421 9.421 
Mixture 2.533 2.533 2.533 2.533 0.269 0.272 0.272 0.272 9.400 9.320 9.320 9.320 
Fairfax 2.598 2.598 2.598 2.598 0.352 0.353 0.353 0.353 7.370 7.369 7.369 7.369 
Schaefer 2.812 2.812 2.812 2.812 0.281 0.273 0.273 0.273 9.999 10.282 10.282 10.282 
Pixley 7 2.982 2.917 2.915 4.897 0.401 0.402 0.401 1.200 9.999 7.248 7.264 4.080 
BugRes 1.730 1.551 1.545 1.703 0.270 0.252 0.229 0.532 6.402 6.148 6.736 3.202 
BugLab 5.541 5.461 4.941 3.631 0.960 1.062 0.948 0.716 5.771 5.142 5.215 5.071 
a Rotenone -2.887 -2.887 -2.887 -2.887 0.351 0.350 0.350 0.350 -8.225 -8.247 -8.247 -8.247 
Deguelin -2.622 -2.622 -2.622 -2.622 0.342 0.339 0.339 0.339 -7.670 -7.743 -7.143 -7.143 
Mixture -2.036 -2.036 -2.036 -2.036 0.271 0.272 0.272 0.272 -7.519 -7.491 -7.491 -7.491 
Fairfax -1.603 -1.603 -1.603 -1.603 0.250 0.249 0.249 0.249 -6.413 -6.435 -6.435 -6.435 
Schaefer -1.622 -1.622 -1.622 -1.622 0.190 0.186 0.186 0.186 -8.530 -8.728 -8.728 -8.728 
Pixley : -3.666 -3.556 -3.552 -6.778 0.531 0.529 0.527 1.832 -6.903 -6.719 -6.741 -3.699 
BugRes -2.387 -2.064 -2.053 -2.338 0.384 0.367 0.315 0.908 -6.218 -5.618 -6.512 -2.575 
BugLab -5.690 -5.587 -4.935 -3.640 1.028 1.141 0.997 0.754 -5.535 -4.897 -4.951 -4.826 

431 # SPSS includes the natural responses proportion (C) by two methods: 1, inputting the value of C; and 


432 2, calculating C from the data. The d.f. = k -2 in method 1, while it was k-3 in method 2. 

433 $ Polo-Plus used the t-ratio to test the significance of the linear regression. The significance criterion 
434 for the t-ratio (a = 0.05) was 1.96 (t-distribution with df= °°). This significance level was identical to that 
435 ofthe z test. 


436 “Bold items indicated the data sets included natural responses. 


437 
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Table 3 Goodness-of-fit of the probit-log(dose) regression models calculated from the 


example data using the ML procedure (Excel), Polo-Plus and SPSS 


K hs g“ 
Examples 

Excel Polot SPSS! SPSS? Excel Polot SPSS! SPSS? LS 
Rotenone 1.729 1.729 1.729 1.729 0.576 0.576 0.576 0.576 0.050 
Deguelin 12.026" 12.026" 12.026" 12.026° 3.006 3.006 3.006 3.006 0.260 
Mixture 4.995 4.995 4.995 4.995 1.249 1.249 1.249 1.249 0.043 
Fairfax 3.754 3.754 3.754 3.754 1.251 1.251 1.251 1.251 0.071 
Schaefer 11.384" 11.384" 11.384" 11.384" 3.795 3.795 3.795 3.795 0.384 
Pixley “ 2.671 2.708 2.712 0.064 1.335 1.354 1.356 0.032 0.069 
BugRes 1.382 1.358 1.362 1.266 0.461 0.453 0.454 0.633 0.094 
BugLab 13.555 11.081 27.454 10.181 1.936 1.583 3.922 1.697 0.325 


$h, heterogeneity factor (see Eq.(17)). SPSS did not give h. To compare the results from this study and 


Polo-Plus, it was shown as h = y7/d.f. here. 


* The g value was calculated as Eq.(22). Polo-Plus and SPSS did not calculate the g values. 


* %7 indicated the goodness-of-fit test was significant at a = 0.05. 


“Bold items indicated the data sets included natural responses. 
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Table 4. LD10, LD50, LDoo and LDoo values with their 95% CLs for the example data fitted to 
probit-log(dose) regression models using the ML procedure (Excel), Polo-Plus and SPSS 
Interested LDx (95% CLs) 
Samples 

levels (x) Excel Polo-Plus SPSS! SPSS? 
Rotenone 2.405 (1.756, 3.295) 2.405 (1.889, 2.833) 2.405 (1.889, 2.833) 2.405 (1.889, 2.833) 
Deguelin 3.229 (1.945, 5.360) 3.229 (0.606, 5.915) 3.229 (0.606, 5.915) 3.229 (0.606, 5.915) 
Mixture 1.986 (1.209, 3.263) 1.986 (0.889, 3.059) * 1.986 (1.286, 2.672) 1.986 (1.286, 2.672) 

10 Fairfax 1.329 (0.736, 2.400) 1.329 (0.392, 2.112) 1.329 (0.820, 1.782) 1.329 (0.820, 1.782) 
Schaefer 1.321 (0.872, 2.001) 1.321 (0.207, 2.247) 1.321 (0.207, 2.247) 1.321 (0.207, 2.247) 
Pixley “ 6.307 (3.011, 13.210) 6.022 (0.393, 10.588) 6.011 (3.765, 7.969) 13.252 (5.512, 18.430) 
BugRes 4.355 (1.721, 11.023) 3.194 (1.143, 5.583) 3.157 (1.373, 5.105) 4.174 (0.082, 11.078) 
BugLab 6.246 (4.714, 8.275) 6.145 (2.450, 8.105) 5.488 (0.011, 8.109) 4.461 (0.927, 6.696) 
Rotenone 4.845 (4.122, 5.696) 4.845 (4.363, 5.354) 4.846 (4.363, 5.354) 4.846 (4.363, 5.354) 
Deguelin 9.905 (7.658, 12.812) 9.905 (5.090, 14.626) 9.905 (5.090, 14.626) 9.905 (5.090, 14.626) 
Mixture 6.366 (4.981, 8.135) 6.366 (4.564, 8.187) 6.366 (5.254, 7.484) 6.366 (5.254, 7.484) 

50 Fairfax 4.139 (3.240, 5.288) 4.139 (2.926, 5.482) 4.139 (3.511, 4.800) 4.139 (3.511, 4.800) 
Schaefer 3.773 (3.110, 4.579) 3.773 (2.198, 5.717) 3.773 (2.198, 5.717) 3.773 (2.198, 5.717) 
Pixley “ 16.967 (12.284, 23.436) 16.559 (8.096, 24.636) 16.544 (13.963, 19.082) 24.208 (16.712, 29.114) 
BugRes 23.981 (16.593, 34.658) 21.413 (11. 546, 28.362) 21.318 (16.502, 27.590) 23.612 (6.574, 35.519) 
BugLab 10.638 (9.336, 12.121) 10.548 (7.912, 12.738) 9.971 (2.962, 14.238) 10.054 (6.699, 13.602) 
Rotenone 9.761 (7.323, 13.011) 9.761 (8.405, 12.134) 9.762 (8.405, 12.134) 9.762 (8.405, 12.134) 
Deguelin 30.381 (22.388, 41.228) 30.381 (19.950, 77.517) 30.381 (19.950, 77.517) 30.381 (19.950, 77.517) 
Mixture 20.407 (14.636, 28.454) 20.407 (15.015, 34.190) 20.407 (16.596, 27.120) 20.407 (16.596, 27.120) 

90 Fairfax 12.892 (7.803, 21.299) 12.892 (8.611, 36.089) 12.892 (10.006, 19.424) 12.892 (10.006, 19.424) 
Schaefer 10.777 (7.559, 15.365) 10.777 (6.747, 50.379) 10.777 (6.747, 50.379 ) 10.777 (6.747, 50.379 ) 
Pixley “ 45.645 (25.980, 80.196) 45.538 (28.964, 329.883) 45.533 (36.541, 64.751) 44.222 (36.854, 63.231) 
BugRes 132.040 (52.601, 331.448) 143.532 (88.364, 344.840) 143.975 (88.678, 333.43) 133.577 (82.497, 723.399) 
BugLab 18.118 (14.484, 22.665) 18.108 (14.508, 35.264) 18.118 (13.196, 1530.98) 22.662 (15.855, 84.406) 
Rotenone 17.278 (10.761, 27.743) 17.278 (13.588, 24.958) 17.278 (13.588, 24.958) 9.762 (8.405, 12.134) 
Deguelin 75.759 (44.790, 128.141) 75.759 (39.827, 460.545) 75.759 (39.827, 460.545) 75.759 (39.827, 460.545) 
Mixture 52.753 (29.785, 93.433) 52.753 (32.074, 135.526) 52.753 (37.441, 87.710) 52.753 (37.441, 87.710) 
Fairfax 32.548 (13.574, 78.046) 32.548 (16.589, 209.890) 32.548 (21.149, 67.448) 32.548 (21.149, 67.448) 

99 Schaefer 25.356 (13.882, 46.314) 25.356 (12.119, 412.504) 25.356 (12.119, 412.504) 25.356 (12.119, 412.504) 
Pixley “ 102.28 (38.072, 274.763) 103.882 (49.732,4503.346) 103.939 (71.350,196.711) 72.273 (53.911,155.013) 
BugRes 530.489 (109.45, 2571.23) 676.988 (295.27, 3261.06) 683.244 (302.10, 2931.66) 548.646 (209.66, 26126.13) 
BugLab 27.97 (19.047, 41.067) 28.133 (19.726, 97.529) — 29.481(17.762, 174201.0) 43.958 (24.635, 485.621) 


*Data in italic brackets indicated that he 95% CLs of LDz calculated using Polo-Plus were not identical to 
those calculated using SPSS. 
«Bold items indicated the data sets included natural responses. 
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Table 5. Lethal dose ratios for the examples fitted to the probit-log(dose) regression models 


calculated by the ML procedure (Excel), Polo-Plus and SPSS 


Interested Lethal ratio (95%CL) RMP (95%CL) # 
Comparison 
levels (x) Excel Polo-Plus SPSS? 
Rotenone/Deguelin 0.745 (0.496, 1.119) 0.745 (0.494, 1.122) 
Rotenone/Mixture 1.211 (0.812, 1.808) 1.211 (0.805, 1.824) 
10 Fairfax/Scheafer 1.006 (0.645, 1.569) 1.006 (0.642, 1.577) 


Fairfax/Pixley $ 


BugRes/BugLab 


0.211 (0.128, 0.346) 


0.697 (0.376, 1.293) 


0.221 (0.132, 0.369) 


0.520 (0.238, 1.138) 


Rotenone/Deguelin 

Rotenone/Mixture 
50 Fairfax/Scheafer 

Fairfax/Pixley a 


BugRes/BugLab 


0.489 (0.398, 0.602) 
0.761 (0.623, 0.929) 
1.097 (0.905, 1.329) 
0.244 (0.198, 0.301) 


2.254 (1.753, 2.898) 


0.489 (0.397, 0.603) 
0.761 (0.621, 0.933) 
1.097 (0.902, 1.335) 
0.250 (0.201, 0.311) 


2.030 (1.478, 2.787) 


0.455 (0.173, 0.793) 


0.710 (0.440, 1.005) 


1.106 (0.811, 1.550) 


0.261 (0.045, 0.571) 


3.898 (0.455, 4701.677) 


Rotenone/Deguelin 
Rotenone/Mixture 
90 Fairfax/Scheafer 


Fairfax/Pixley a 


0.321 (0.243, 0.425) 
0.478 (0.357, 0.642) 
1.196 (0.819, 1.747) 


0.282 (0.189, 0.422) 


0.321 (0.241, 0.428) 


0.478 (0.354, 0.646) 


1.196 (0.814, 1.758) 


0.283(0.186, 0.430) 


BugRes/BugLab 7.288 (4.014, 13.232) 7.926 (4.077, 15.412) 
Rotenone/Deguelin 0.228 (0.142, 0.366) 0.228 (0.140, 0.371) 
Rotenone/Mixture 0.328 (0.199, 0.539) 0.328 (0.197, 0.546) 
99 Fairfax/Scheafer 1.284 (0.667, 2.469) 1.284 (0.661, 2.493) 
Fairfax/Pixley 0.318 (0.158, 0.642) 0.313 (0.151, 0.651) 
BugRes/BugLab 18.968 (6.820, 52.753) 24.064 (7.520, 77.007) 


# RMP, relative median potency. We did not show the RMP of SPSS by inputting C methods because of 


different C values in the two samples. 


“Bold items indicated the data sets included natural responses in the control group. 


19 


463 
464 


465 
466 
467 


468 


Table 6. Tests of parallelism between the probit-log(dose) regression lines calculated using 


the ML procedure (Excel), Polo-Plus and SPSS 


Comparison Excel Polo-Plus SPSS” 
Z Parallelism X df=v) Parallelism Xas- Parallelism 
Rotenone vs Deguelin 2.844 Rejected 8.41 Rejected 10.216 Rejected 
Rotenone vs Mixture 3.049 Rejected 9.68 Rejected 9.284 Rejected 
Fairfax vs Scheafer 0.475 Accepted 0.23 Accepted 0.000 Accepted 
Fairfax vs Pixley 0.720 Accepted 0.36 Accepted 0.598 Accepted 
BugRes vs BugLab 3.821 Rejected 22.10 Rejected 24.840 Rejected 


# We did not compare parallelism among the regression lines calculated by SPSS by inputting C methods 


because of different C values in the two samples. 
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Figure legends 


Fig. 1 Standardized residuals versus log(doses) after fitting the Schaefer (A) and Buglab (B) 
dataset to probit-log(dose) models 


Fig. 2 The three categories of parallelism between two regression lines 


(A) Fairfax vs Pixley; (B) Rotenone vs Deguelin; (C) BugRes vs BugLab 


Additional file 
Calculation of LDs and their ratios.xlsx (344 kb), which requires Microsoft Excel 2010 or 


higher. It is available via a link: https://figshare.com/s/f94393f752fcc 1 5faea7. 
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